Picture for Jiatao Gu

Jiatao Gu

Learning When to Think While Listening in Large Audio-Language Models

Add code
May 26, 2026
Viaarxiv icon

Normalizing Flows with Iterative Denoising

Add code
Apr 21, 2026
Viaarxiv icon

Grokking of Diffusion Models: Case Study on Modular Addition

Add code
Apr 20, 2026
Viaarxiv icon

Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages

Add code
Mar 18, 2026
Viaarxiv icon

The Coupling Within: Flow Matching via Distilled Normalizing Flows

Add code
Mar 09, 2026
Viaarxiv icon

OmniGuide: Universal Guidance Fields for Enhancing Generalist Robot Policies

Add code
Mar 09, 2026
Viaarxiv icon

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Add code
Jan 13, 2026
Viaarxiv icon

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Add code
Dec 16, 2025
Viaarxiv icon

DenseAnnotate: Enabling Scalable Dense Caption Collection for Images and 3D Scenes via Spoken Descriptions

Add code
Nov 16, 2025
Viaarxiv icon

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Add code
Jun 26, 2025
Viaarxiv icon